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ABSTRACT 

Using stellar kinematics measurements, we investigate whether massive, quiescent galaxies were denser at 
Z ~2 than they are today. We present X-Shooter spectra from the UV to NIR and dynamical mass measurements 
of 5 quiescent massive (> 10 n M ) galaxies at z ~ 2. This triples the sample of z > 1.5 galaxies with well 
constrained (6cr < 100km s _1 ) velocity dispersion measurements. From spectral population synthesis modeling 
we find that these galaxies have stellar ages that range from 0.5-2 Gyr, with no sign of on-going star formation. 
We measure velocity dispersions (290-450 km s ) and find that they are 1.6-2.1 times higher than those of 
galaxies in the SDSS at fixed mass. Sizes are measured using GALFIT from HST-WFC3 Z/160 and UDS ri- 
band images. The dynamical masses correspond well to the SED-based stellar masses, with dynamical masses 
that are ~ 15% higher. We find that M»/M^ vn may decrease slightly with time, which could reflect the increase 
of the dark matter fraction within an effective radius. We combine different stellar kinematic studies from the 
literature, and examine the structural evolution from z ~ 2 to z ~ 0: we confirm that at fixed dynamical mass, 
the effective radius increases by a factor of ~ 2.8, and the velocity dispersion decreases by a factor of ~ 1.7 
with time. The mass density within one effective radius decreases by a factor of ~ 21, while within a fixed 
physical radius (1 kpc) it decreases only mildly (factor of ~ 2.3). When we allow for an evolving mass limit 
by selecting a population of galaxies at fixed number density, a stronger size growth with time is found (factor 
of ~ 4), velocity dispersion decreases by a factor of ~ 1 .4, and interestingly, the mass density within 1 kpc 
is consistent with no evolution. This finding suggests that massive quiescent galaxies at z ~ 2 grow in an 
inside-out matter, consistent with the expectations from minor mergers. 
Subject headings: cosmology: observations — galaxies: evolution — galaxies: formation 



1. INTRODUCTION 

Recent studies have shown that a considerable fraction of 
massive galaxies at 1 .5 < z < 2.5 h a ve quiescent stel- 
lar populations (e.g lLabbe et al.1 120051: iKriek et al.l 120061: 
I Williams et al.l 120091) . Among the most massive galaxies 
(M, > 10 11 M p ) approximately 40% are no longer form - 
ing stars (e.g. fwhitaker et al.1 1201 It iBrammer et al.1 1201 ll) . 
Surprisingly, these massive quiescent g alaxies have been 
found to be extrem ely compact (e.g. iDaddi et al.l 120051: 
Truiill o et al.l 120061: Ivan Dokkum et all l2008t lFranxetal.1 
2008; Ivan der Wei et al.l 120081: and numerous others), com- 
pared to their likely present-day counterparts. 

Searches for any ultra- dense low-red s hift co unterparts by 
ITrujillo et al.l §009) and iTaylor et al.1 (l2010al) found only 
a handful of compact sources at z ~ 0, tha t have 
relatively young stella r populations (ITrujillo et al.1 120091: 
iFerre-Mateu et al.ll2.012b . The dearth of massive, old compact 
objects at low redshift implies that massive galaxies must have 
undergone severe structural evolution in size. 
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Errors in the size estimates have been invoked as a pos- 
sible explanation for the compactness of massive high- 
redshift galaxies. Initial concerns that the size may have 
been underestimated, due to an envelope of low sur- 
face brightness l ight, have been addressed with deep HST 
WFC3 imaging dSzomoruet all 120 10t 120121), and by stack- 
ing results (e.g Ivan der Wei et al.1 120081: ICassata et alj|2010t 
Ivan Dokkum et al.l 120081 I2010I) . The light could also be 
more concentrated due to the presence of active galactic nu- 
clei (AGN) in these galaxies. However, spectra of these 
galaxies have shown that the light is domin ate d by e volved 
stellar populations not AGN (IKriek et alJ 120061 120091: 
Ivan de Sande et al.ll201 UlOnodera et al.ll2012l) . 

The question of whether stellar masses are accurate out to 
z ~ 2 remains, however, of paramount importance: an over- 
estimate in stellar mass would bring the galaxies closer to the 
z ~ mass-size relation. To date, basically all (stellar) masses 
have been derived by fitting the spectral energy distributions. 
This method suffers from many systematic un certainties in 
stella r population synthes is models (SPS) (e.g IConrov et al.l 
120091: iMuzzin et al.| [2009) and is essentially untested at z > 
1.5. 

Direct stellar kinematic mass measurements, which do not 
suffer from these uncertainties, can be derived by measuring 
the galaxy's velocity dispersion and the shape and extent of 
its luminosity profile, i.e. the Sersic n parameter and effective 
radius. In particular , for low-redshift galaxies in the SDSS, 
Taylor et al.l ((2010b) showed that stellar mass is a very good 
predictor of dynamical mass, but only when non-homology 
of luminosity profile is properly acc ounted for if we includ e 
a Sersic dependent virial factor (e.g. ICappellari et al.l 2006). 
Although dynamical measurements of massive galaxies are 
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Fig. 1. — Comparison of our spectroscopic sample to the full population at similar redshift. Squares represent the density of galaxies from the NMBS-I and 
UDS at 1.4 < z < 2.1 with mass > 10 I0 5 M Q . Left panel: Rest-frame U-V and V-J colors. Color coding is based upon the sSFR derived from SED fitting, red 
colors indicates low sSFR (quiescent), and blue colors high sSFR(star-forming). Gal axies in the to p left region, as marked by the black line, all have low sSFR 
rates. This region is therefore often used to select quiescent galaxies at high redshift (Williams et al. 2009). All but one of our galaxies fall within this region, but 
their sSFR indicate that they are al l truly quiescent. The vertical dotted line discriminates between young post-starburst like (left) versus old quiescent (right) as 
indicated by Whitaker et al. 12012). The strong Balmer absorption lines spectroscopically confirm the young ages of this sample. Middle panel: Rest-frame U-V 
versus stellar mass. At fixed mass, we find that most of our galaxies have similar colors to the entire population, except for NMBS-COS7447 and UDS-19627 on 
the blue side. Right panel: //-band aperture magnitude versus stellar mass. It is clear that our sample was selected on magnitude, and at fixed mass they are truly 
among the brightest galaxies, consistent with their post-starburst nature. 
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Targets and Observations 
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M yr" 1 


NMBS-COS 


7447 


10:00:06.96 


02:17:33.77 


120 


21.09 


20.72 


20.63 


19.64 


1.20 


0.31 


< 18 


< 13 


NMBS-COS 


18265 


10:00:40.83 


02:28:52.15 


90 


22.67 


20.85 


20.61 


19.62 


1.71 


0.86 


< 18 


< 15 


NMBS-COS 


7865 


10:00:17.73 


02:17:52.75 


434 


22.75 


21.51 


21.07 


20.02 


1.78 


0.89 


i 18 


i 19 


UDS 


19627 


02:18:17.06 


-05:21:38.83 


300 


21.40 


20.91 


20.65 


20.19 


1.37 


0.68 


< 30 


< 29 


UDS 


29410 


02:17:51.22 


-05:16:21.84 


120 


20.59 


20.18 


19.81 


19.36 


1.61 


0.95 


< 30 


< 22 



common at low redshift, spectroscopic studies become much 
more difficult at higher redshift as the bulk of the light, and 
absorption features used to measure kin ematics, shift red- 
ward into the near- infrared (MR) (e.g. iKriek et al.1 2009; 
Ivan Dold<:umlFaIll2009l) . 

New technology such as the new red arm of the LRIS spec- 
trograph at Keck (working beyond lyum) mak es it possible to 
measure velocity dispersion s up to z ~ 1.5 dNewman et alj 
120 101 iBezanson et al.l 120121) . Deep NIR spectroscopy is, 
however, required to push stellar kinematic studies to even 
higher redshift. From a ~29hr spectrum of an ultra- 
compact galaxy at z — 2 . 2 obt ai ned with Gemini Near-IR 
Spectrograph dKriek et alj 12009b . Ivan Dokkum et all d2009l) 
found a high, though uncertain, velocity dispersion of cr — 
5 10! If km s -1 . lOnodera et all (120121) used the MOIRCS 
on the Subaru telescope to observe the rest-frame optical 
spectrum of a less-compact, passive, ultra-massive galaxy 
at z — 1.82, but the low spectral resolution and signal-to- 
noise severely limited the accuracy of their velocity disper- 
sion: o- = 270+ 105km s" 1 . X-Shooter (iD'Odorico et al.1 
2006), the new UV to NIR spectrograph at the VLT, can pro- 
vide the required S/N and resolution. The capabilities of X- 
Shooter for this kind of m easurements were demonstrated in 
Ivan de Sande et al.1 (120111) , who fo und 294 + 51km s _1 for a 
massive quiescent galaxy at z = 1 . 8 . iToft et all (120 121) also use 
X-Shooter and present a dynamical measurement of a galaxy 
at redshift z=2.04 with similar results. Taken all together, 
these results indicate that the dynamical and stellar masses 
are consistent with z ~ 0. With the small number of measure- 



ments beyond z > 1 .5, however, the sample is still too small to 
draw any firm conclusions on whether the stellar masses are 
truly reliable. 

Here we present a sample of 5 massive quiescent galax- 
ies with high Signal-to-Noise (S/N), medium-resolution, UV- 
NIR spectra at 1.4 < z < 2.1 observed with X-Shooter on the 
VLT. The main goal of this paper will be to test if the stellar 
mass measurements at high redshift are reliable. 

The paper is organized as follows. In $2] we present our 
sample of high-redshift galaxies, the photometric and spec- 
troscopic data, and describe our data reduction. In |3]we de- 
termine structural properties and stellar populations, and de- 
rive stellar and dynamical masses. We complement our re- 
sults with stellar kinematic results from other studies at low 
and high redshift in ^4] In §|5] we compare our dynamical to 
the stellar masses. In §|6]we study the structural evolution of 
high-redshift quiescent massive galaxies. In §|7] we compare 
our results with previous measurements and hydrodynamical 
simulations. Finally, in |8]we summarize our results and con- 
clusions. Throughout the paper we assume a ACDM cosmol- 
ogy with Q m =0.3, Q A = 0.7, and H Q = 70 km s _1 Mpc -1 . All 
broadband data are given in the AB-based photometric sys- 
tem. 

2. DATA 

2.1. Target Selection 

The galaxies in th is paper ar e drawn from the N MBS-I 
dWhitaker et alj|20!ob and UDS dWilliams et alj|2009b . They 
were selected to be bright in the //-band, and to have z > 1 .4, 
in order to obtain sufficient S/N in 2 hours. The SED from 
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Fig. 2. — Color Images of our five spectroscopic targets. Except for UDS-19627, all galaxies have available HST-WFC3 Imaging. For each target we show the 
composite color image on the left side, the best Sersic model from GALFIT and the residual after we subtract the best-fitting model from the original image on 
the right side. The lower panel shows the intrinsic surface brightness profile with all available bands. Different colors show the different filters, as indicated on 
the bottom right. Vertical dashed lines show the effective radii for each profile, while the dotted lines shows the FWHM/2 of the PSF. We find color gradients, 
such that the redder bands have smaller effective radii, for all galaxies but NMBS-COS7447. In that case, the sizes are similar within the errors, but this could be 
caused by the extra flux of the red arc-like feature in the South-East. 

the broad and medium-band photometry was required to indi- 
cate that they have quiescent stellar populations, and the rest- 
frame optical imaging could not show signs of large distur- 
bance due to e. g. mergers. We note that NMBS-COS7447 
is presented in Ivan de Sande et alJ (120111) . and UDS-19627 
is presented in Toft et al.l (120121) . All data for both galaxies 
have been re-analyzed according to the following procedure 
for consistency. Our selection had no priors on mass or size, 
but could be biased in either one of these parameters. 

To investigate this, we compare our targets to a sample of 
galaxies with mass > 10 I0 ' 5 M o at 1.4 < z < 2.1 from the 
NMBS-I and UDS. Rest-frame U - V and V - J colors are 
often used to distinguish betw een star-forming and quiescent 
galaxies at this redshift (e.g. Williams et al. 2009). 



Figure 

[TJ; shows the UVJ-diagram for all galaxies at redshifts be- 
tween 1.4 < z < 2.1 with mass > 1O 1(X5 M , together with 
the sample presented here. The size of the squares indi- 
cates the density of galaxies. For our targets, the rest-frame 
colors have been measured from the spectra and best-fitting 
models, while for the full sample rest-frame colors are based 
on the broad and medium-band data. As demonstrated by 



IWilliams et al.l (120091) . non-star-forming galaxies can be se- 
lected using a color selection indicated by the black lines. 
Within this selection region, our tar gets fall in the region occu- 
pied by young, quiescent galaxies ( Whitaker et al. 2012J). The 
median specific star formation rates (sSFR), as indicated by 
the different colors, are in good agreement with the full high- 
redshift sample at the same place in the UV-VJ diagram. For 
their mass, however, NMBS-COS7447 and UDS-19627 have 
slightly bluer colors as compared to the full sample (Figure 
Q}?). At fixed mass, the targets are among the brightest galax- 
ies, except for NMBS-I-7865 (QJ). This may not come as a 
surprise as they are among the youngest quiescent galaxies, 
and thus have relatively low M/L. 

2.2. Spectroscopic Observations 

Obs ervations were perform ed with X-Shooter on the VLT 
UT2 dD'Odorico et al.|[2.006h . X-Shooter is a second gener- 
ation instrument on the VLT that consists of 3 arms: UVB, 
VIS, and NIR. The wavelength coverage ranges from 3000 to 
24800 A in one single exposure. The galaxies were observed 
in both visitor and service mode, and the observations were 
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Fig. 3. — UV to NIR X-Shooter spectra in combination with medium- and broad-band data (blue diamonds). The binned spectra (~ 10A) are shown in black, 
together with the best-fitting BC03 r-model as shown in red. Grey areas indicate regions with strong sky emission or atmospheric absorption. The UVB spectrum 
is missing for UDS-29410, due to an instrument problem during the observations. The good agreement between the BC03 models and the spectroscopic data over 
this large wavelength range is uncanny, and shows the power of modern spectrographs. Both NMBS-COS7865 and UDS-19627 show a small deviation from the 
best-fitting model around l/.fm, which is caused by the absence of good telluric calibrators. From stellar population synthesis modeling, we find a variety of ages 
that range from 0.5-2 Gyr. We find no emission lines, and other signs of star formation, and with very little to no dust, these galaxies are quiescent (see Section 
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Fig. 4. — Rest-frame optical part of the spectrum focused on the Balmer break. As in Figure[3] the X-Shooter spectrum is shown in black, but this time in higher 
resolution (~ 4A observed, or ~ 100 km s rest-frame). The most prominent absorption features are indicated by the blue dashed lines. The clear detection of 
absorption lines enables us to measure stellar velocity dispersions. We use pPXF to fit the best-fitting BC03 r model to the spectrum and find velocity dispersions 
that range from 275-435 km s . The convolved best-fit BC03 template is shown in red. 
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TABLE 2 

Stellar Population Synthesis Properties 



Catalog ID z phot z spec logr log Age Z A v logM, log SFR log sSFR 

(yr) (yr) (mag) (M G ) (Moyr 1 ) (yr 1 ) 



NMBS-COS 7447 1.71+0.03 1.800 7.80 

NMBS-COS 18265 1.60±0.03 1.583 7.00 

NMBS-COS 7865 2.02±0.05 2.091 7.20 

UDS 19627 1.94±0.06 2.036 7.90 

UDS 29410 1.44±0.02 1.456 8.00 
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0.020 
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0.050 


0.15 
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0.05 
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Note. — No errors are provided, as the 68% confidence all fall within one grid point. The real errors are dominated by systematic 
uncertainties. 



carried out between January 2010 and March 201 1 (Programs: 
Fynbo 084.A-0303(D), Van de Sande 084.A-1082(A), Franx 
085.A-0962(A), Toft 086.B-0955(A) ). Full information on 
the targets and observations are listed in TableQ] All observa- 
tions had clear sky conditions and an average seeing of 0.8". 
A 0.9" slit was used in the NIR, except for the 1st hour of 
UDS-19627 where a 0.6" slit was used. For the 0.9" slit, this 
resulted in a spectral resolution of 5100 at 1.4/rni. Observ- 
ing blocks were split in exposures of each 10 to 15 minutes 
with an ABA'B' on-source dither pattern. For most targets, a 
telluric standard of type B8V-B9V was observed before and 
after our primary target, in order to create a telluric absorption 
spectrum at the same airmass as the observation of our target. 

2.3. Spectroscopic Reduction 

Data from the three arms of X-Shooter must be analyzed 
separately and then combined to cover the full range from the 
UV-NIR. In the NIR we identified bad pixels in the following 
way. The data were corrected for dark current, flatfielded, and 
sky subtracted using the average of the precedin g and subse- 
quent frames. The ESO pipeline (version 1.3.7. lGoldoni et all 
2006) was used to derive a wavelength solution for all orders. 
The orders were then straightened using integer pixel shifts to 
retain the pixels affected by cosmic -rays and bad pixels. Ad- 
ditional sky subtraction was done on the rectified orders, by 
subtracting the median in the spatial direction. Cosmic rays 
and bad pixels were identified by LA-Cosmic (van Dokkum 
2001), and a bad pixel mask was created. 

Further 3-sigma clipping was done on the different expo- 
sures, corrected for dithers, to identify any remaining outliers. 
The bad pixel masks of different orders were combined into a 
single file and then transformed back to the raw frame for each 
exposure. In the following, the mask file will follow the same 
rectification and wavelength calibration steps as the science 
frames. 

The raw science frames were corrected for dark current, 
flatfielded, and sky subtracted using the average of the pre- 
ceding and subsequent frames. Next the observations were 
rectified and wavelength calibrated, only this time we used 
interpolation when rectifying the different orders. Again, ad- 
ditional sky subtraction was done. Per order, all exposures 
were combined, with exclusion of bad pixels and those con- 
taminated with cosmic rays present in the mask file. 

The telluric spectra were reduced in the same way as the 
science frames. We constructed a response spectrum from 
the telluric stars in combination with a stellar model for a 
B8V/B9V star from a black body curve and models from 
Muna rTet al.l (|2005). Residuals from Balmer absorption fea- 
tures in the spectrum of the tellurics were removed by inter- 
polation. All the orders of the science observations were cor- 



rected for instrumental response and atmospheric absorption 
by dividing by the response spectrum. 

The different orders were then combined, and in regions 
of overlap weighted using the S/N of the galaxy spectrum. 
A noise spectrum was created by measuring the noise in the 
spatial direction below and above the galaxy. If the regions 
exceeded an acceptable noise limit, from contamination by 
OH lines or due to low atmospheric transmission, this spatial 
region was discarded for further use. The 2D spectra were 
visually inspected for emission lines, but none were found. 
A ID spectrum was extracted by adding all lines (along the 
wavelength direction), with flux greater than 0.1 times the 
flux in the central row, using optimal weighting. Absolute 
flux calibration was performed by scaling the spectrum to the 
available photometric data. 

A low resolution spectrum was constructed by binning the 
2D spectrum in wavelength direction. Using a bi-weight 
mean, 20 good pixels, i.e. not affected by skylines or strong 
atmospheric absorption, were combined. The ID spectrum 
was extracted from this binned 2D spectrum in a similar fash- 
ion as the high resolution spectrum (see Figure\3]&$\). 

For the UVB and VIS arms w e used the ESO pipeline (ver- 
sion 1.3.7, iGoldoni et al.ll200"6l) to correct for the bias, flat- 
field, and dark current, and to derive the wavelength solution. 
The science frames were also rectified using the pipeline, but 
thereafter, treated in exactly the same as the rectified 2D spec- 
tra of the NIR arm, as described above. 

2.4. Imaging 

Four different imaging datasets are used to measure the 
surface brightness profiles of our galaxies, as summarized 
below. (1) All our target in the NMBS-I field were ob- 
served with HST-WFC3 Z/ieo as part of the program HST- 
GO-12167 (PI: Franx). Each targets was observed for one 
orbit (2611 sec), using a four point dither pattern, with half 
pixel offsets. Reduction o f the data was done i n a similar 
way as to that described in iBouwens et al.l {2010), but with- 
out sigma-clipping in order to avoid masking the centers of 
stars. The drizzled images have a pixel scale of 0.06", with 
a full width at half-maximum (FWHM) of the point-spread 
function (PSF) of ~ 0.16". (2) Our NMBS-I targets are 
compl emented with HST-ACS hu imaging from COSMOS 
(v2.0, lKoekemoer et al.l l2007: Mas sev et alfcoiOl) , which has 
a 0.03" pixel scale and PSF-FWHM of - 0.11". (3)ForUDS- 
29410 we make use of the HST-ACS F814W, HST-WFC3 
Jns and H\ m from U DS-CANDELS dGrogin et al.l 1201 lh 
Koekemoer et al. 201 1). These data have the same properties 
as the data described in (1) and (2). (4) For UDS-19627 we 
use ground based data fro m UKIDSS-UDS, dLawrence et al.l 
120071: IWarren et al.l l2007) Data Release 8 in the J, H, and K- 
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TABLE 3 

Compilation of masses and structural parameters for high-redshift galaxies 



Reference 11 ID Zspec r e ^sersic 

b/a 

7447 1.800 1.75 ±0.21 5.27 ± 0.23 

18265 1.583 0.97 ± 0.12 2.97 ± 0.06 

7865 2.091 2.65 ± 0.33 4.82 ±0.15 

19627 2.036 1.32 ±0.17 3.61 ±0.73 

29410 1.456 1.83 ± 0.23 2.59 ± 0.03 



cr e a- e /cr ap f}(n) \ogM dyn \ogMl corr Filter 



0.71 ±0.02 


287 + | 


1.048 


5.16 


0.26 ±0.02 


400+?* 


1.065 


6.61 


0.83 ±0.02 


446+5* 
304+J 


1.031 


5.42 


0.48 ±0.06 


1.059 


6.18 


0.54 ±0.02 


371+1' 4 

•" 1 -90 


1.045 


6.88 



ll-24+||j^ 11.22 #F160W 

11.38+j,,. 11.32 #F160W 

11.82+™ 11.64 H F160W 

1 1.24+0 1° U2 o k 

11.6l2 : }! 11-24 tf F160W 



Note. — This Table will be published in its entirety in the electronic edition of ApJ, and can also be downloaded from 
http: //www. strw.leidenuniv.nl/~sande/data/ A portion is shown here for guidance regarding its form and content. 

a References: 0) Thi s Work 1) Bezanson et al. (2012); 2) van DokkuinetaT](2003); 3) IOnodera etal] Um% ; 4) ICappellari"etaj] <2009l) ; 5) INewman etal] <20l0l) ; 
6) van der Wei et al. ( 2008) & Blakeslee et al. 1 2006) 7) Toft et al. 1 2012). 

k Stellar masses are corrected to account for the difference between the catalog magnitude and our measured magnitude. 



band, as no as HST data is available. Imaging from all three 
bands were drizzled to a pixel scale 0.134", and the FWHM 
of the PSF is 0.7" in the K-band. 

3 . STRUCTURAL PROPERTIES AND STELLAR POPULATIONS 

3.1. Surface Brightness Profiles 

Radial profiles are measured for all gal axies on all avail- 
able imaging as described in Section 12.41 Galaxies are fit- 
ted by two-dimensional Seisic radial surface brig htness pro- 
files ((SersidI 19681) . using GALFIT (version 3.0.2: iPeng etaTl 
120101) . Relatively large cutouts of 25" x 25" were provided 
to GALFIT to ensure an accurate measurement of the back- 
ground, which was a free parameter in the fit. All neighboring 
sources wer e masked using a segme ntation map obtained with 
SExtractor dBertin & Arnoutslll996l) . In the case of UDS- 
19627, the close neighbor was fitted simultaneously. Bright 
unsaturated field stars were used for the PSF convolution. All 
parameters, including the sky, were left free for GALFIT to 
determine. 

Even though galaxies at low redshift are we ll-fitted by sin- 
gle Sersic profiles (e.g. iKormendv et al.ll2009l) . this does not 
necessarily have to be true for galaxies at z ~ 2. Therefore, 
we correct for missin g flux using the method described in 
ISzomoru et al.l |2010). We find very small deviation in resid- 
ual corrected effective radii, with a median absolute deviation 
of 3.4%. Color images and measured profiles are shown in 
Figure |2] 

We repeated the measurements using a variety of PSF stars 
(N ~ 25). We find an absolute median deviation in the half- 
light radius of ~ 3% for HST-WFC3, ~ 3.5% for HST-ACS, 
and ~ 10% for the ground-based UDS-UKIDSS data, due to 
variations in the PSF. The largest source of uncertainty in the 
measured profiles is, however, caused by the error in the sky 
background estimate. Even though these galaxies are among 
the brightest at this redshift, using the wrong sky value can 
result in large errors for both r e and n. We determine the error 
in the sky background estimate by measuring the variations of 
the residual flux in the profile between 5 and 15 arcseconds. 
For sizes derived from HST-ACS, the absolute median devi- 
ation in the effective radius due to the uncertainty in back- 
ground is ~ 13%, and for HST-WFC3 ~ 12%. Due to the 
deeper ground-based UDS-UKIDSS data, the uncertainty for 
UDS-19627 due to the sky is ~ 8%. All of our results are 
summarized in Table|3] In what follows, we will use the mean 
effective radius and Sersic n from the band which is closest to 
rest-frame optical r' . 



3.2. Stellar population properties 

We estimate the stellar population properties by fitting the 
low-resolution (~ 10A in observed frame) spectrum in the 
Visual and NIR in combination with the broad- and medium- 
band photometry with SPS models. We exclude the UVB part 
of the spectrum due to the lower S/N and the extensive high 
S/N broad band pho tometry in this wavelen gth region. Stel- 
lar templates from lBruzual & Charlotj|2003l (BC03) are used, 
with an exponentially declini ng star formati on history with 
timescale t, t ogethe r with a IChabrierl (120031) IMF, and the 
iCalzetti et all (|2000l) redd ening law. 

Using the FAST code (iKriek et al.ll2009l) we fit a full grid 
in age, dust content, star formation timescale, and metallicity. 
We adopt a grid for t between 10 Myr and 1 Gyr in steps 
op 0.1 dex. The age range can vary between 0.1 Gyr and 
10 Gyr, but the maximum age is constrained to the age of 
the Universe at that particular redshift. Step size in age is 
set as high as the BC03 templates allow, typically 0.01 dex. 
Metallicity can vary between Z = 0.004 (subsolar), Z = 0.08, 
Z = 0.02(solar), and Z = 0.05 (supersolar). Furthermore, we 
allow dust attenuation to range from to 2 magnitudes with 
step size of 0.05. The redshift us ed h ere is from the best-fitting 
velocity dispersion (see Section [3~3~l l. Results are summarized 
in Table |2] 

Due to our discrete grid and the high quality data, and also 
because metallicity and age are limited by the BC03 models, 
our 68% confidence levels are all within one grid point. Our 
formal errors are therefore mostly zero, and not shown in Ta- 
ble |2] This does not reflect the true uncertainties, which are 
dominated by ch oice of SPS models, IMF, SFH, and ex tinc- 
tion law (see e.g. IConrov et al.ll2009tlMuzzin et al.ll2009l) . 

The low sSFR confirms the quiescent nature of the galaxies 
in our sample, and they match well with the sSFR of the gen- 
eral population in the same region of the UVJ diagram (Figure 
QJ). We find a range of metallicities, with the oldest galaxy 
having the lowest metallicity. However, due to the strong de- 
pendency between age and metallicity, we do not believe this 
result to be significant. Overall, the dust content in our galax- 
ies is low. 

The galaxies in our sample are not detected at 24 pm, lead- 
ing to a 3-cr upper-limit of 18 pJy for t he galaxies in N MBS- 
COSMOS and 30 pJy in the UDS (see Whi taker et al.ll2012t 
iToft et all 120 121) . From these upper limits we calculate the 
dust-enshrouded SFRs which are listed in Table Q] 
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Fig. 5. — Comparison of the stellar mass versus the dynamical mass. Grey 
squares are again non-star-forming galaxies from the SDSS. Different sym- 
bols are from a compilation of high-redshift galaxies as described in Section 
14.21 The dashed line is for equal dynamical and stellar mass. Low-redshift 
galaxies are all below the line, as is expected from the contribution of dark 
matter. All our high-redshift galaxies have dynamical masses that are equal or 
larger than the stellar mass. This suggests that the stellar mass measurements 
at high-redshift are robust for passive galaxies. 

3.3. Velocity dispersions 

The clear detection of absorption lines in our spectra, to- 
gether with the medium resolution of X-Shooter, allow us 
to measure accurate stellar velocity dispersions. We use the 
unbinned spectra in com bination with the Penal i zed Pixel- 
Fitting method (pPXF) bv lCappellari & Emselleml d2004l) and 
our best-fitting BC03 models as templates. Spectra were put 
on to a logarithmic wavelength scale without using interpo- 
lation, but with masking of the bad pixels. The effect of 
template mismatch was reduced by simultaneously fitting the 
template with a ~17-order Legendre Polynomial. Our results 
depend only slightly on the choice of the order of the poly- 
nomial (Appendix lAb. Together with the measured velocity 
dispersion, the fit also gives us the line of sight velocity, and 
thusz S pec. 

In particular for the younger galaxies in our sample that 
show a clear signature of A-type stars, we find a dependence 
of the measured velocity dispersion as a function of template 
age. A more stable fit is obtained when restricting the wave- 
length range to 4020A < A < 7000A, which excludes the 
Balmer break region (see also AppendixlAl. 

The errors on the velocity dispersion were determined in 
the following way. We subtracted the best-fit model from the 
spectrum. The residuals were randomly rearranged in wave- 
length space and added to the best-fit template. We deter- 
mined the velocity dispersion of 500 simulated spectra. Our 
quoted error is the standard deviation of the resulting distribu- 
tion of the measured velocity dispersions. When we include 
the Balmer break region in the fit, the formal random error de- 
creases, but the derived dispersion becomes more dependent 
on the chosen stellar template. In total we have three high- 
quality measurements, and two with medium quality. We 
note that if we exclude the two medium-quality measurements 
from our sample, our science results would not change. 

All dispersions are corrected for instrumental resolution 
(<r=23 km s _1 ) and the spectral resolution of the templates 



(<r=85 km s _1 ). Furthermore, we apply an aperture correc- 
tion to our measurements as if they were observed within a 
circular aperture radius of r e . In addition to the traditional 
corr ection for the radial d ependence of velocity dispersion 
(e.g. ICappellari et al.ll2006l) . we account for the effects of the 
non-circular aperture, seeing and optimal extraction of the 1 - 
D spectrum. The aperture corrections are small with a median 
of 4.8% (See Appendix IB!. The final dispersions and corre- 
sponding uncertainties are given in Table[3] 

3.4. Dynamical Masses 

Combining the size and velocity dispersion measurements 
we are now able to estimate dynamical masses using the fol- 
lowing expression: 



M dya = 



f3(n) cr e r e 



(1) 



Here /?(«) is an analyti c expression as a funct ion of the Sersic 
index, as described by Cappellari et al. ( 2006): 



/?(«) = 8.87 - 0.83 In + 0.0241n 2 



(2) 



This is computed from theoretical predictions for [5 from 
spherical isotropic models described by the Sersic profile, for 
differ ent values of n, and integrated to one r e (cf. Berl in et al.l 
2002). Using a Sersic depended virial constant j3(n) gives a 
better corr espondence between Md yn and M, for galaxies in 
the SDSS dTavlor et alj|2010bl) . This does require however, 
that the total stellar masses are also derived using the lumi- 
nosity of the derived Sersic profile. Thus we correct our total 
stellar mass, as derived from the total magnitude as given in 
the catalogs (measured with Sextractor), to the total magni- 
tude from the Sersic fit. We note that the values for /3 that we 
find are all close to 5, a value often used in the literature (e.g. 
Capp ellari et alj|2006l) . Our dynamical masses and corrected 
stellar masses are given in Table[3] 

4. COMPILATION OF KINEMATIC STUDIES 

Before we are able to study the structural evolution of our 
galaxies, we first combine the results from different kinematic 
studies at various redshifts. Where possible, we apply similar 
corrections as described above. 

4.1. Low-redshift sample 

At low redshift we select galaxies from the Sloan Digi- 
tal Sky Survey (SDSS, DR7). Stellar masses are based on 
MRAB fits to the phot omet ry following the p hilosophy of 
Kauff mann et all d2003l) andlSalim et alj (120071). Star forma- 
tion rates are based on iBrinchmann et all d2004f) . Structural 
parameters are from the NYU Val ue-Added Galaxy Catalog 
(NYU-VAGC. iBlanton etall 120051) . For all galaxies, veloc- 
ity dispe rsions were aperture corrected as describe d in Sec- 
tion 13.31 and stellar masses are calculated with a Chabrier 
(2003) IMF. We furthermore correct the stellar masses using 
the total magnitude from the best Sersic fit. All dynamical 
masses were derived using Equation Q] For making an accu- 
rate comparison between low- and high-redshift galaxies, we 
only select non-star-f orming galaxies, i.e sSFR < 0.3/f# (see 
IWiiliams et al.l 12009). where f# is the age of the universe at 
the given redshift. 

8 http://www.mpa-garching.mpg.de/SDSS/DR7/ 
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Fig. 6. — Left panel: Stellar mass divided by the dynamical mass versus redshift. Galaxies below the line have dynamical mass greater than the stellar mass, 
above the line is the non-physical regime. For SDSS galaxies with stellar mass > 10"M Q , we find that they have a median M t /Mjy n of 0.59. Up to redshift 
z = 1.5 we find a similar slightly lower median value (~ 0.5), but it rapidly increases at z > 1.5 with median a of M t /Mjy„ = 0.9. The solid line is the best-fit 



M,/M dm °c(l+ z )' 



0.17±0.04 



. We caution that this result might be biased due to the selection effects as explained in Section fj] and relatively large measurement 



errors. Right panel: Stellar mass divided by the dynamical mass versus the evoltion in the effective radius at fixed dynamical mass. Galaxies that have small 
effective radii at fixed dynamical mass also show higher ratios of M, /Mjy,, , although there is a significant scatter. 



4.2. Intermediate- and high-redshift sample 

Our high redshift sample consists of a collection of both 
optica l and NIR spectroscopi c studies on non-stacked sam- 
ples. Ivan der Wei et al.l (|2008) present a sample of quiescent 
galaxies at z ~ 1, which itself is a compilation of three studies 
in the following fields: Chan dra Deep Field South (CDF-S; 
Ivan der Wei eTal]l2004t l200l. the Hubble Deep Field North 
(HDF-N: lTreuetal.ll2005at l2005bl). and cluster g alaxies in 
MS 1054-0321 atz = 0.831 dWuvtsetal] l2004f . We de- 
rive stellar masses for this sample by running FAST o n avail- 
able cat alogs, i.e., FIREWORK S dWuvts et all 120081) for the 
CDS-S. ISkelton et al. I din prep.1) for the HDF-N, and FIRES 
(HEier Schreiberetal]|2P06) for MS 1054-0321. For CDF- 
S and HDF-N the stellar masses are corrected using the total 
magnitude from the best n - 4 fit to be consisten t with the 
structural parameters from lvan der Wei et al.l (120081) . For MS 
1054-0321, we use structural par ameters and stel l ar ma ss cor- 
rections based on the results by Blakes lee et al.1 (120061) . who 
fit Sersic profiles with n as a free parameter. 

Other high - redshi ft re sults included here are from 
iNewman et al.l (120101) . and Bezan son et al.l (120121) . who use 
the new red-arm of LRIS on Keck to obtain UV rest- 
frame spectra of galaxies at z ~ 1.3 and z ~ 1.5 respec- 
tively. Velocity dispersions for t wo ga laxies at z - 141 
are presented by Cappell ari et al.l (120091) . and have been ob- 
serve d with VLT-FORS2 (but see also iCenarro & T ruiillo 
120091) . Using NIR sp ectrographs. lOnodera et al.l d2012l 
Subaru-MOIRCS), and Ivan Dokkum et al.l (120091 GNIRS) 
obtained velocity dispersions for two gal axies at z — 1.82 
and z = 2.186. Similar to this study, iToft et all (120121) 
study UDS-19627 using VLT X-Shooter. Dynamical masses 
were deriv ed using to Eq u ation [D Note that for th e 
studies of ICappellari et al.l (l2009l)Honodera et al l d2012l) . 
Ivan Dokkum et al.L and lToft et al.l (120121) no stellar mass cor- 
rections were applied due to the absence of the necessary 
information. All structural and kinematic properties of our 
high-redshift sample are listed in Table [3] 



5. ARE STELLAR MASSES RELIABLE? 

The main goal of this paper is to see whether the stellar 
masses at z ~ 2 are reliable. Here we compare our stellar 
masses, as derived from the spectra and photometry, to our 
dynamical masses, which are derived from effective radii and 
stellar velocity dispersions (Figure|5]l. Grey squares represent 
the density of non-star-forming, low-redshift galaxies from 
the SDSS as described in Section |4T| Other symbols are the 
high-redshift studies as described in Section l4~2l The one-to- 
one relation for Md yn and M„ is indicated by the dashed line. 
Note that the region above the line is nonphysical with stellar 
masses being higher than the dynamical mass. 

We find that most z > 1 .5 galaxies are very massive, in the 
range 1 1.2 < log M^„/M < 1 1.8. At all redshifts, stellar and 
dynamical masses are tightly correlated and dynamical mass, 
which includes baryonic and dark matter, is always higher 
than stellar mass. Thus, we infer that the stellar masses are 
broadly correct, and that the apparent size evolution of mas- 
sive galaxies in photometric studies canno t be explained by 
errors in the photometric masses (see also Ivan der Wei et al.l 
l2008h . 

Figure [6^ shows the ratio of the stellar and dynamical 
mass as function of redshift for all galaxies with stellar mass 
> 10 n M o . We see that the average ratio at low-redshift for 
massive galaxies is a factor of 0.59 with a scatter of 0. 12 dex. 
We note that For MS 1054-0321, the ratio of the stellar to dy- 
namical mass are slightly higher as compared to low redshift 
galaxies. Up to redshift z ~ 1.5 we find a similar value (~ 0.5) 
with similar scatter, but at higher redshift, the ratio seems to 
decline. For galaxies at z > 1.5 we find a median ratio of 
Mt/Mdyn = 0.9. We quantify the evolution in this ratio by 
fitting the relation: 



MJM dyn K(l+z) a . 



(3) 



We find a = 0.17 ± 0.04 (linear fit in log-log space), shown 
as the solid black line in Figure^. Due to the relatively large 
measurements errors as compared to low redshift, and the se- 
lection bias, we are cautious to draw any strong conclusions 
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those of Bezans on et alj (120121) (cyan circles) are in general 
more compact as compared to the high-redshift CANDELS 
galaxies (small green circles). 

For low-redshift galaxies we parametrize the mass-size re- 
lation by: 



Fig. 7. — Structural comparison of our spectroscopic sample to the full population. Left panel: Effective radii versus mass for low- and high-redshift galaxies. 
Grey squares are non- star-forming z ~ galaxies from the SDSS, with the dashed line the best-fit to Equationf4] High-redshift quiesce nt galaxies (1.4 < z < 2.1) 
from CANDELS are shown as green circles, together with our spectroscopic sample shown as red circles, and the sample by Bezanson et al. ( 2012) shown as 
cyan circles. At fixed mass, our high-redshift galaxies have smaller effective radii, similar to what has been found by many studies. Middle panel: Evolution in 
effective radius at fixed stellar mass, thus corrected for the M. - r e relation from the left panel. Big green squares are the median effective radii in bins of redshift 
for the CANDELS data. The solid line is the best r e oc (1 + z )-l-<K±0.05 At similar re dshift, we find that our sample and that of Bezanson et al. 1 2012) is mostly 
below this fit, indicating that our samples our biased towards smaller effective radii. Right panel: Similar to the middle panel, but now divided by (1 + z)~' 02 
for a better comparison of our spectroscopic sample to the CANDELS data. When comparing the median of our sample to the binned median of other quiescent 
galaxies at similar redshift, we find smaller effective radii by a factor ~ 1.17. This might be be explained by us using a selection based on aperture magnitude, 
which tends to be biased towards smaller galaxies. 

from this. Another way of thinking about this is that the ve- 
locity dispersions inferred from the stellar mass and effective 
radii are higher than the m easured velocity dispersions (see 
also iBezanson et aT] 120121) . It is tempting to speculate that 
this might be caused by a decrease in the dark matter fraction 
as a function of redshift. If galaxies grow in size over time, 
then the dark matter fraction within r ? will also increase i n 
a similar fashion as shown here (e.g. Hop kins et alj|200"93) . 
If so, this could also indicate that the IMF at high-redshift is 
very similar to the IMF at low-redshift. 

Figure^ shows M,/M</ v „ versus the evolution of the effec- 
tive radius at fixed dynamical dynamical mass (see Section 
I6.2l and Figure[8]l. Although there is significant scatter, we do 
find the galaxies with high M„IM<j sn also tend to have small- 
est size at fixed dynamical mass. Galaxies that are closest to 
the present-day mass-size relation (dashed vertical line) show 
lower ratios of stellar to dynamical mass. This agrees with the 
idea that the decreasing ratio of M t /M^ vn with time might be 
correlated to the size growth of massive galaxies. 



10 n M o 



(4) 



6. STRUCTURAL EVOLUTION OF QUIESCENT GALAXIES 

From the previous section we now know that the dynami- 
cal masses are consistent with the stellar masses. However, 
for the high-redshift galaxies in our sample, the dynamical 
masses are somewhat lower than expected from the stellar 
masses. For this reason, we will re-examine the structural evo- 
lution of massive quiescent galaxies but now using dynamical 
measurements. 

6.1. Bias towards compact galaxies 

As noted in Section 12.11 this sample is biased towards 
young quiescent galaxies. Theref ore, we will first inves ti- 
gate whether our sample and that of Bezans on et al] d2012l) is 
also biased in size as compared to other high-redshift galaxies. 
We gathered structural properties of galaxies from two stud- 
ies tha t use CANDELS data in the UPS and GOODS-South 
fields (Patel et al. 2012; Szomoru et al. 2012). We compare to 
a subsample of these galaxies which are determined to be qui- 
escent from their rest-frame U-V and V-J colors (see e.g. Fig- 
ureQJ). When comparing the effective radii versus the stellar 
mass in Figure ITJj, we find that our galaxies (red circles) and 



dShen et al.1 12001 Ivan der Wei etafl l2Q08h . Using a linear 
least-squares fit in log-log space, we find best fitting values 
of r c — 4.32 kpc a n d b = 0.62. This is slightly different 
from the IShen et aT] d2003l) results who find r c = 4.16 kpc 
and b = 0.56. The difference may be explained by different 
selection criteria, and their use of an older release version of 
SDSS. Figure [7J? shows the evolution in effective radius, by 
comparing galaxies with similar mass at different redshifts. 
Using both the SDSS and the CANDELS data, we examine 
the amount of evolution in size by fitting the following rela- 
tion: 

r e K(l+z) a , (5) 

We find a = -1.02 + 0.05 (linea r fit in log-log space). Our 
spectroscopic targets and those of Bez anson et al.l d2012l) are 
mostly below this best-fit relation, being smaller by a factor 
of ~ 1.17 as compared to median in redshift bins (big green 
squares). This is especially clear from Figure [Tj?, where we 
correct for the evolution in size. 

The explanation for this bias might be related to the way our 
targets are selected. As our selection is based on the magni- 
tude within a fixed aperture of 1.5", instead of the total mag- 
nitude, we create a bias towards compact galaxies. For galax- 
ies with similar total magnitudes, the smaller galaxies will be 
brighter within a photometric aperture, which will increase 
the S/N of their spectra, and thus make it into our sample. In 
what follows, we will correct for t his bi as by increasing our 
sizes and those of Bezanson et al. (2012) by a factor of 1.17, 
and decreasing the velocity dispersion by a factor of Vl-17. 

6.2. Evolution in Size 

In Figure^ we plot effective radius versus dynamical mass. 
Symbols are the same as in Figure [6] For the low-redshift 
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Fig. 8. — Redshift evolution of the effective radius in passive galaxies. Left panel: Size versus dynamical mass. Dashed line is the best-fit using Equation|6]for 
non-star-forming SDSS galaxies. At fixed dynamical mass, we find that our galaxies are smaller by a factor ~ 3 compared to low-reshift galaxies. Middle panel: 
Evolution of the effective radius at fixed dynamical mass versus redshift. The solid line is the best-fit r e oc (1 + z)~ ^, similar to what has been found by 
other kinematic studies at high-redshift. Similar to Figure|7] our sample is below the best-fit line to the entire high-redshift sample. Right panel: Evolution of the 
effective radius using an evolving mass function at constant number density. We now compare galaxies at high-redshift, to more massive galaxies at low-redshift, 
assuming the mass evolves as A log M/M Q ~ 0. 15z. This time, we find an even stronger evolution, with r e oc (1 + z )-l-32±0.l 



galaxies we parametrize the mass-size relation according to 
the following equation 



yn 



10 U M F 



(6) 



and find r c = 3.23 kpc and b = 0.56 (dashed line). This is 
in good agreement with b - 0.56 and r c - 3.26 kpc as found 
by Ivan der Wei et aTl (120081) . At fixed dynamical mass, we see 
that all our galaxies have smaller effective radii as compared 
to low-redshift. This is further illustrated in Figure [SJ), where 
we compare the effective radii at fixed dynamical mass to the 
mass-size relation at z ~ 0. The solid line is the best-fit as de- 
scribed by equation|5] with a = -0.99 ±0.1. This is in agree- 
ment to what has been found in previous kin ematical studies 
( Ivan der Wei et al.1 120081: iNewman etafl 120 1 Ol) . The scatter 
between different s tudies is considerable, with the work by 
Ivan Dokkum et al] d2009l) h aving the largest size difference 
while lOnodera et al.1 (120 12T) has the smallest. This sample 
falls in between the two ex trem es, but we note that corrected 
for the bias in size (Section loTTT i. 

Instead of comparing galaxies sizes at fixed dynamical 
mass, we wil l now take into a cco unt that galaxies do evolve 
in mass (e.g. iPatel et al.ll2012l) . In Ivan Dokkum etal.1 (120101) 
they find that, for a sample selected at a constant number den- 
sity, the stellar mass evolves as 



logMJMo = 1 1.45- 0.1 5z. 



(7) 



The number density that this result is based upon, n — 2 x 
10~ 4 Mpc~ 3 , corresponds to an average mass of log M»/M ~ 
11.15 at z ~ 2, similar to our sample. Assuming that the 
mass evolves as A log M/M G ~ 0.15z, we will now use mass 
evolution to compare effective radii for galaxies at different 
redshifts. For example, a galaxy with log M,/ v „/M = 11 
at z ~ 2 will thus be compared with a z ~ galaxy with 
logMd y „/M G = 11.3. Figure [SJ; shows the evolution in size 
at fixed number density as function of redshift. Not surpris- 
ing, the evolution in effective radii is more extreme, as we are 
now comparing z ~ 2 galaxies to more massive, and therefore 
bigger galaxies at z ~ 0. Again using Equation [5] we now 
find that a = -1.32 ± 0.1 provides the best fit. In conclusion, 
assuming that galaxies evolve in both mass and size, we find 
that the effective radii have to grow by a factor ~ 4 from z ~ 2 



to the present day. 

6.3. Evolution in Velocity Dispersion 

In Figure [9^ we compare the stellar velocity dispersion 
within one r e versus the dynamical mass for both low- and 
high-redshift samples. The dashed line is the parametrization 
of the cr e - Md yn relation for low-z galaxies using the follow- 
ing equation: 



o- c 



M dy 



10 n M c 



(8) 



We find that o> = 148.9 km s 1 and b = 0.24. Our high- 
redshift sample is clearly offset from low-redshift galaxies in 
the SDSS, i.e. at fixed mass they have higher velocity disper- 
sions. Comparison of the velocity dispersion at fixed dynam- 
ical mass, as seen in [9}3, shows a clear evolution in cr e , such 
that velocity dispersion decreases over time. From this fig- 
ure the increase in accuracy for the velocity dispersion mea- 
surements with new instruments, as compared to other older 
studies, is also clearly noticeable. Again we use the following 
simple relation to quantify the amount of evolution: 



cr e cc(l +zf, 



(9) 



and find that a = 0.50 + 0.08. From z ~ 2 to z ~ the stellar 
velocity dispersions decrease by a factor ~ 1.7. Again, we 
note that we applied a correction to the velocity dispersions 
in our sample, in order to cor rect for the bias towards more 
compact galaxies (Section loTTI ). If we now compare low- and 
high-redshift galaxies using an evolving mass function as de- 
scribed above, we find that the velocity dispersion decreases 
less than when comparing at fixed dynamical mass. If we use 
equation|9] we find that a = 0.36 ± 0.08. In other words, the 
velocity dispersion decreases by a factor ~ 1 .4 from z ~ 2 to 
present-day. 

6.4. Evolution in Mass Density 
Next, we will focus on the central and effective mass den- 



sities u sing a similar approach as described in ISaracco et alj 
( 120121) . In short, using the intrinsic Sersic profile we can cal- 
culate the fraction of the luminosity that is within 1 kpc as 
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Fig. 9. — Redshift evolution in velocity dispersion. Left panel: cr e versus dynamical mass. At fixed dynamical mass, we find that our galaxies have higher 
velocity dispersion as compared to low-redshift galaxies. Middle panel: Evolution of velocity dispersions at fixed dynamical mass versus redshift. We find the 
cr e oc (1 + Z )0 50±0-<B^ • e a( . g xec j (jynajnicai mass the velocity dispersion decreases by a factor of ~ 1.7 from z ~ 2 to the present day. Right panel: Evolution 
of the velocity dispersion with an evolving mass function at constant number density. Opposite to the evolution in size, we now find a milder evolution of 
£reK(1+z) 0.36±0.08_ 



compared to th e total lumin osity. For a Sersic profile this ra- 
tio is given by dCiottill 19911) : 



Likpc _ y(2n,x) 
L to , T(2n) 



(10) 



Here, T(2n) is the complete gamma function, y(2n, x) the in- 
complete gamma function, x = b n {r\k pc lr e ) l l n , with b„ = 
1.9992n - 0.3271. Using this ratio we can now calculate the 
dynamical mass within one 1 kpc and within r e from the total 

mass: 



Akpc 



pc 



M dyn . 



(11) 



Here we assume that the dynamical mass profile follows 
the light profile, and furthermore that the mass-to-light ra- 
tio of the galaxy is radially constant. The detection of small 
color gradients in our galaxies indicates, however, that this is 
not the case, but the effect o n the derived densitie s is small 
dSaracco et alj|2012t see also fSzomoru et ai1l2012f) . Finally, 
the densities are calculated as follows: 



AM 



Plkpc = 



I kpc 



Ikpc 



and 



Pe = 



2M dyn 



3nn 



(12) 



(13) 



Similarly, we now compare the density as a function of dy- 
namical mass (see Figure [Tot. The top row shows the results 
for the density within one effective radius, while the bottom 
row compares the central density within 1 kpc. The first thing 
to notice is the large scatter for low-redshift galaxies when 
looking at p e versus M dyn , while p\k pc versus M dyn shows a 
tight relation. The density-mass relation can be parametrized 
by: 



P =Pc 



10 n M o 



(14) 



For the density within r e we find p ce = 4.7 x 10 8 M o kpc~ 3 and 
b e = -0.68 + 0.15, and for the central density within 1 kpc 
PcAkpc - 6.6 x lO 9 M kpc~ 3 and b\k pc = 0.56. 

When comparing the galaxies in our high-redshift sample to 
galaxies in the SDSS, we find that they have higher densities 
within r e . Comparison at equal dynamical mass shows that 



the effective densities are higher by a a factor of ~ 50 (Fig- 
ure [TOb ) for our sample. The same comparison, but now for 
the central density within 1 kpc, reveals only mild evolution, 
approximately factor of ~ 3 from z ~ 2. When fitting 



p«(l + z)°, 



(15) 



we find that a e = 2.95 + 0.15, while a xkpc = 0.77 + 0.15. 

Instead of comparing galaxies at fixed mass, we again take 
into consideration that galaxies evolve in mass when compar- 
ing low- and high-redshift galaxies. This time, we find that p e 
evolves even faster as compared to the equal mass comparison 
a e = 3.35 + 0.15. The density within 1 kpc, however, requires 
a decrease less than a factor of ~ 2, a\k pc = 0.44 + 0.15, to 
evolve into present day galaxies. 

7. DISCUSSION 

In the previous section we have found that in order for the 
high-redshift galaxies in our sample to evolve into typical 
present-day galaxies, strong structural evolution is required. 
Effective radii need to increase, and the velocity dispersion 
within r e has to decrease, while the density within the effec- 
tive radius has to decrease by more than an order of mag- 
nitude. However, the central density can remain almost the 
same. 

The dominant physical mechanism for this structural evo- 
lution is still a subject of ongoing debate. Size growth dom- 
inated by major mergers seems to be unlikely as it would in- 
crease the masses too much, which would make extremely 
massive galaxies too common in the local universe. As the 
mass and size increase at approximately at the same rate 
in major mergers, the galaxies would also remain too com- 
pact for their mass. Minor merging could offer a solution to 
the problem, as it ca n grow a galaxy i n effective radius (r K ) 
steeper than r, oc M* (iNaab et alJl2009t iBezanson et alJl2009t 
Hopki ris"et al.ll2009bl ). In this scenario, the observed com- 
pact high-redshift galaxies may simply be the cores of local 
massive early-type galaxies, which grow inside-out by accret- 
ing (smaller) galaxies, an d thus assemble a significant part of 
their mass at later times (Ivan der Wei et al.l 12009b lOser et aD 
2010). In this section we will examine if dry minor merging 
agrees with our findings. 

From a simple es timate, based on the virial theorem, 
IBezanson et al.l d2009l) predict how the effective radii changes 
if a massive galaxy undergoes a series of minor mergers. With 
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effective density, i.e. p e oc (1 + z) 



3.35+0.15 



while the central density requires very little to no evolution p\t pc K (1 + z) 



0.44+0.15 



only eight 1:10 mergers, the effective radii can grow by a fac- 
tor of ~ 5 while only having th e mass increase by a factor of 
~ 2. This is also described by iNaab et all (120091) . who state 
that if an initial system undergoes a mass increase by a fac- 
tor of 2 due to accretion of very small systems, then the final 
radius of the system is four times larger, the velocity disper- 
sion is reduced by a factor of 2, and the density is reduced 
by a factor of 32. This prediction is confirmed from by their 
hydrodynamical cosmological simulation which gives similar 
size evolution as presented here. 

Using hydrodynam ic simulations of galaxy mergers, 
Hopk inset al.l d2009al) . also find evidence for size evolu- 
tion. When comparing effective radii at fixed mass, they 
find an evolution in size of r e oc (1 + z)~ 48 for galaxies 
with log M„/M Q - 11, which is weaker th an found by this 
study and many others. Ose r et all d2012l) find a size evo- 
lution in their hydro simulation, which is much stronger: 
r e oc (1 + z)~ 1 44 , on the high side of current results. 

lOser et aD (120 121) find a similar evolution in velocity disper- 
sion of cr oc (1 + z) 0,44 , to that found in this work. In contrast, 
IHopkins et al.l (l2009al) . predict such systems have roughly the 
same or at most a factor ~ 1 .25 larger velocity dispersions. 

Evolution of the densi ty is also di s cussed in both 
iBezanson etaLl ( 120091) and IHopkins et ail d2009bb . Based 
on photometric data, both studies find that while the den- 



sity within one effective radius is higher at high-redshift, the 
central density of high-redshift galaxies is very si milar to lo- 
cal ma ssive ellipticals. From hydro simulations, Naab et al. 
(2009) show that the central density within lkpc decreases by 
a factor of 1.5 from z — 2 to z — 0, cau sed by dynamica l 
friction from the surviving cores. Similar, lOser et all (120121) 
show that the central density evolves only weakly, while the 
density within r e decreases rapidly by more than an order of 
magnitude, in good agreement with wh at we find here . 

This is in contrast with the findings o flSaracco et alJ d2012), 
who find no evidence for higher effective densities when com- 
paring their z ~ 1.5 galaxies to low-redshift cluster galaxies. 
Furthermore, the large scatter that they observed in the effec- 
tive density and the apparent evolution, is simply due a pecu- 
liar feature in the Sersic profile. 

In Figure^ we showed that the ratio of M m /Md y „ may have 
evolved from z ~ 2 to z ~ 0. As compared to SDSS galaxies 
with log M*/M Q > 11, we find that the median M t /M dyn is 
higher by 50% at z > 1.5, and that MJM dyn oc (1 + Z )0.17±0.04 
However, this result is still uncertain due to the selection ef- 
fects inherent in this sample and large measurement errors on 
both masses. We note that this effect is predicted by simu- 
lations; as the effective radius of a galaxy ev olves, the dark 
matter fraction within r e will also increase. Hopki ns"et alj 
(2009a) predicts evolution by a factor of ~ 1.25 for galaxies 
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with log M,/M H = 11, w ith the effect increasing with stel- 
lar mass. iHilz et all (120121) also find a strong evolution in the 
dark matter fraction in their hydro-simulation, and predict that 
quiescent galaxies at z ~ 2 have lower dark matter fractions 
(> 80%). They mention that is mainly driven by the strong 
size increase, which therefore probes a larger region that is 
dominated by dark matter. 

8. SUMMARY AND CONCLUSION 

In this paper, we present deep UV-NIR spectroscopy of 5 
massive (> 10 11 M ) galaxies at z ~ 2, using X-Shooter 
on the VLT. These spectra enable us to measure stellar ve- 
locity dispersions with higher accuracy than done before at 
this redshift: we triple the sample of z > 1.5 galaxies with 
well constrained (6cr < 100 km s _1 ) velocity dispersion mea- 
surements. We find that the stellar velocity dispersions are 
high (290-450 km s _1 ) compared to equal-mass galaxies in 
the SDSS. 

We combine these kinematic results with size measure- 
ments using GALFIT on HST-WFC3 H l60 and UDS K-band 
imaging, and use these measurements to derive dynamical 
masses. Stellar masses are obtained from stellar population 
synthesis modeling on the VIS-NIR spectra in combination 
with the available broad- and medium-band data. The SPS- 
modeling shows that our galaxies have ages ranging from 0.5- 
2 Gyr, and show no signs of on-going star formation. We com- 
plement our results with stellar kinematic results from other 
studies at low and high redshift to study the structural evolu- 
tion of massive quiescent galaxies. 

We find good correspondence between the dynamical and 
stellar masses, with the dynamical mass being higher by 
~ 15%. Our results suggest that stellar mass measurements 
for quiescent galaxies at high-redshift are robust. 

At fixed dynamical mass we find that the effective radius 
increases by a factor of ~ 2.8, while the velocity dispersion 
decreases by a factor of ~ 1.7 from z ~ 2 to the present day. 
Furthermore, we study how the mass density within r e and 
1 kpc evolves with time. While we find a strong decrease 
of the mass density within one effective radius (factor of ~ 
21), it only decreases mildly within 1 kpc (factor of ~ 2.3). 
Instead of comparing galaxies at fixed dynamical mass, we 
also use an evolving mass limit as defined by fixed number 
density. By accounting for concurrent mass growth in our 
comparison of high- and low-redshift galaxy populations, we 
find an even stronger evolution in galaxy sizes (factor of ~ 4). 
We find that velocity dispersion decreases less dramatically 
with time, differing by only a factor of ~ 1.4. Finally, for 
the mass density within r e , we find a stronger evolution, but 
interestingly, the mass density within 1 kpc is consistent with 
no evolution. 

We examine if our results are compatible with the current 
idea that massive quiescent galaxies at z ~ 2 grow inside out 
throu gh dry min o r mer gers. From hydrodynamical simula- 
tions, [Oser et al] (120121) find similar results to those shown 
here. Even though their size evolution is somewhat stronger, 
they do find a similar decrease in the velocity dispersion. Fur- 
thermore, their simulations show that the central density re- 



mains almost the same, while at the same time the effective 
density decreases by more than an order of magnitude, in 
good agreement with our findings. 

Finally, we find that even though the stellar masses are con- 
sistent with the dynamical masses, the ratio of M t /Md yn may 
decrease with time. This, too, is predicted by minor merg- 
ing simulations, which show that the size growth due to mi- 
nor merging is accompanied by an increase of the dark matter 
fraction within an effective radius. This is due to the fact that 
the dark matter profile is less steep than the stellar mass pro- 
file, and thus the dark matter to stellar mass fraction increases 
with radius. 

Despite the vastly improved accuracy of our derived 
dynamical masses and stellar population parameters, the 
broader inferences of our study is still limited by the very 
small number of high-redshift galaxies with such information. 
We have shown that our sample is biased towards younger 
galaxies, compared to a stellar mass limited sample at z ~ 2, 
with smaller effective radii as compared to the full population 
of quiescent galaxies z ~ 2. Only with a larger unbiased 
sample of massive quiescent galaxies at high redshift can we 
start to comprehend the final phase that massive galaxies go 
through in becoming today's ellipticals. 
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APPENDIX 

ROBUSTNESS OF THE VELOCITY DISPERSION MEASUREMENTS 

As it has only recently become possible to measure velocity dispersions at high redshift, the stability of these measurements has 
barely been tested. In this appendix, we will study the effect of fitted wavelength range, template choice, degree of the additive 
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Fig. 11. — Velocity dispersion versus wavelength range. Left panel shows the results for A slar , < A < 7000A, while the right panel is for 3600A < A < A enc [. 
The horizontal shaded region indicates the 1-cr error from pPXF, the dashed horizontal line indicates the velocity dispersion when fitting the full range. Most 
prominent absorption lines are indicated, and the regions affected by strong sky-lines and atmospheric absorption are shown in gray. Overall, we find a stable 
solution for the velocity dispersion while changing the wavelength range. For NMBS-C7447 and UDS-19627 we do find a small increase in the region around 
the Balmer break. For NMBS-C17637, we find that excluding Na D has a great impact on the velocity dispersion. 



polynomial, and choice of stellar populations models. 



Dependence of the Velocity Dispersion on the Wavelength Range 

Our sample has a redshift range of 1.4 < z < 2.1, which means that different parts of the rest-frame spectra will be affected 
by sky-lines and atmospheric absorption for each galaxy. This can be seen from Figure [4] which shows that we often lose strong 
absorption features in our spectrum, which affects the region of the rest-frame spectrum that we can use for the fit. Here we 
investigate how stable the measured velocity dispersion is as a function of the wavelength range. 

For the velocity dispersion fitting in this Paper, the lower wavelength limit is set by stellar libraries and models, as no systematic 
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Fig. 12. — Reduced^ 2 from SPS-modeling versus velocity dispersion. Different points indicate different template ages, while different colors show different 
metallicities. All fits are done on the full-wavelength range and with one-order per 6000km s for the additive polynomial. We find a stable solution for the 
velocity dispersion by using different templates, except for UDS-19627, which shows a large range in velocity dispersions. 

high-resolution observations exist below 3550A. The higher wavelength limit is set by lower S/N in our spectra in the observed 
K-Band. Our approach for testing the wavelength dependence of the fit is as follows. First, we use the full-range spectrum to 
determine a best-fit polynomial (1 order per 10000 kms~'), which is used to correct for the difference between the observed 
and the template continuum. Next, we repeat the measurement with a zeroth-order polynomial while changing the start or end 
wavelength. The polynomial is not a free parameter in this fit, as this would make it impossible to separate between the effect 
of the polynomial and the wavelength range. Note also, that we use a single temp late for all fits as determined from the full 
Visual+NIR spectrum together with broad- and medium-band data (see Section [3~2l i. 

FigureQT|shows the results for the different sources. The left panel shows the result where we change the starting wavelength, 
i.e. the wavelength range is A start < A < 7000A, whereas the panels on the right show the effect of changing the end wavelength 
3600A < A < A en d- The first thing to notice is that the measured dispersions are remarkably stable, even when most of the 
absorption lines have been excluded from the fit. The two galaxies with the youngest ages and strong Balmer absorption lines 
(NMBS-C7447 and UDS-19627) show a change in the velocity dispersion in the region of the Balmer break. NMBS-C7447 
shows an increase for A star , > 3800A, but decreases after Ca H&K have been removed from the fit. UDS-19627 shows an 
increase of 50 km s _1 when the Balmer break is excluded. When we reduce the red part of the spectrum (FigureQT] right panels), 
we also find a very stable fit, except for NMBS-C17637. After excluding Na D from the fit, the velocity dispersion increases by 
a ~100 kms -1 . We think this is because NMBS-C17637 has a more evolved stellar populations than say for example NMBS- 
C7447. With the Ca H&K lines masked out due to atmospheric absorption, Na D is one of the strongest lines in the spectrum, 
and its exclusion could explain the sudden change in the measured dispersion. 

To summarize, for most galaxies we find no strong dependence on the wavelength range that is used in the fit. We do find that 
with decreasing wavelength range, the random error increases. Finally, even in the absence of strong absorption features like Ca 
H&K, we find similar velocity dispersions as compared to the full range fit. 



Dependence of the Velocity Dispersion on the Template Choice 

Next, we study how different templates may influence the measured velocity dispersion. We use a sample of BC03 r-models, 
as presented in Section EOl In particular, we are interested in the effect of template age and metallicity. In Figure[T2]we show the 
reduced^ 2 from the SPS-modeling, versus the velocity dispersions measured using this template. The different points represent 
different ages of the templates, with the minimum x corresponding to the best-fit age as listed in Table [2] Different colors 
indicate the different metallicities. We show the^- 2 from the SPS-modeling instead of the dispersion fit, as the former is derived 
from the full Visual and NIR spectrum plus all the broad- and medium-band photometric data. This yields better constraints for 
the stellar population, thus larger relative ranges in % 2 as compared to the x 1 from the dispersion fit. Also, as we add high-order 
additive polynomials to the templates before fitting the velocity dispersion (in this case 1 order per 6 00010ns- 1 ), the effect of 
different template-ages is mostly washed out, and we get a small relative^ 2 . 

In Figure Q~2] we see that for most galaxies the velocity dispersion for templates allowed within 2<r give consistent results. 
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Fig. 13. — Reduced^ 2 versus velocity dispersion, similar to Figure [12] but now excluding the Balmer break (A > 4020A). Whereas with the full wavelength 
range UDS 19627 showed a large variation in the velocity dispersion as a function of template, this time we do find a stable solution. The other sources also, 
show a decrease in the uncertainty due to different templates, except for UDS-29410. However, this is due to lower S/N of this spectrum and the shorter fitted 
wavelength range as before. 



Different metallicities do give different velocity dispersions at their minimum^ 2 , but this is a mere reflection of the age-metallicity 
degeneracy. Different metallicities have different best-fitting ages, which in turn give different velocity dispersion. At a 1 - <x 
level, we only have a handful of best-fitting templates, from which we obtain similar velocity dispersions. UDS-19627 is the 
exception, which shows a large dependency of the measured velocity dispersion as a function of template age. At the 1 — cr 
level, we find a range of ~ 30kms~', due to template uncertainty, while the random error is one of the lowest, only ~ 30km s . 
Even though templates with x 1 of 1 .20 or higher are statistically considered a bad fit, the large range in the velocity dispersion 
is worrying. If we could not have constrained the best-fitting template from the full range spectrum and broad band data, the 
measured velocity dispersion would be highly uncertain. 

However, if we exclude the Balmer break from the velocity dispersion fit, the dependency on template age almost completely 
disappears (Figure |T3l>. The case of UDS-19627 is the most dramatic one, where we suddenly see a tight range of best-fitting 
velocity dispersions. We do note that the velocity dispersion has increased, as was already shown in Figure [TT] UDS-29410 
appears to have a slightly higher template uncertainty if we only fit for A > 4020 A, but this is driven by the lower S/N of the 
spectrum as we now use a shorter wavelength range. 

To conclude, we find a systematic uncertainty due to templates with different ages. This is caused by the Balmer break, present 
in the relatively young galaxies in our sample. By only fitting for A > 4020 A, the uncertainty due template mismatch almost 
completely disappears. In that case, templates with different metallicities do give different velocity dispersions, but this is most 
likely caused by the underlying age-metallicity degeneracy. 

Dependence of the Velocity Dispersion on the Order of the Additive Polynomial 

In order to correct for stellar continuum emission differences between the observed galaxy spectrum and the template, we use an 
additive Legendre polynomial. If we would not apply such a correction, the fitting routine could try to correct for this discrepancy 
by changing the velocity dispersion. Values that are typically used in the literature vary from 5000 to 15000 km s _1 per order. We 
do not use multiplicative Legendre polynomials, because the S/N of the spectra are too low, and it would add another degree of 
uncertainty to the fit. Here, we test the influence of the additive polynomial to our measured velocity dispersions. Again we use 
the best-fit T-model as a template while varying the additive polynomials from to 50. We fit both the full-wavelength range and 
the wavelength range with A > 4020 A. 

Figure [14] shows the results, with the blue line representing the full-wavelength fit, while the red line shows the results for 
A > 4020 A. The vertical dashed line indicates the polynomial with one order per 10000 kms -1 . Overall we find that by 
increasing the additive polynomial, the measured velocity dispersion increases. In general we find that between a polynomial of 
10 and 30, the smallest increase in the velocity dispersion occurs, and this appears to be the most stable region. For this reason 
we use one order per 10000 km s _1 for our science results. 
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Fig. 14. — Velocity dispersion as measured with different order additive polynomials. The full-wavelength range fit is shown in blue, while the fit with A > 4020 A 
is shown in red. We find an increase of the velocity dispersion with increasing order. The fit is most stable when using a polynomial with order between 10 and 
30. The vertical dashed lines shows the order that corresponds to 10000 km s , the polynomial that we use for obtaining science results. 



Dependence of the Velocity Dispersion on the SPS-models and Template Construction 

Here, we test how our velocity dispersions would change if we make different choices for the SPS-model, and test the difference 
between a single r-model and a template constructed from different SSP models. 

The left and middle panel of Figure [15] sho w what would hap pen if we would choose the Flexible-SPS models bv lConrov et alj 
( 2009) (CO09) or the models by Maraston & Strombackl (l201 ll) (MAI 1). These models a re based on a different st ellar library with 
slight ly higher resolution as compared to BC03, i.e. MILES dSanchez-Blazquez et al.ll2006h versus STELIB dLe Borgne et alj 
2003). If a systematic uncertainty in the measured velocity dispersion is present, for example due to the resolution or details 
that go into the SPS-models, it would show up in this comparison. We determin ed a best-fit r model using the CO09 and MAI 1 
models in exactly the same way as was done for the BC03 models (See Section l3~2l . When comparing the velocity dispersions 
derived by using the BC03 and CO09 r-models, we do not find any significant difference (right panel of Figure[l3Tl. In the middle 
panel we compare MAI 1 and BC03. Here too, we find a good correspondence, but for UDS 19627 the MAI 1 models give a lower 
velocity dispersion, but still within the 1 — cr error. These results confirms that our measurements are stable against different 
SPS-models and templates with different resolution. 

Finally, in Figure Q3] (right panel) we compare the velocity dispersion from the best-fit r-model, versus an optimal template 
constructed by pPXF This optimal template is build from a full spectral library from BC03 Single Stellar Population (SSP) 
models, with full range in age and metallicity. We note that the optimal template construction only uses the wavelength regime 
provided in the dispersion fit (3600 A < X > 7000 A) , and does not take into account the effects of dust. The velocity dispersions 
from the optimal templates are slightly higher as compared to the single r-model, although it is well below the random error. 
Interestingly, the galaxy with the largest dust-contribution, NMBS-C17637, also shows the largest difference between the two 
different fitting techniques. In this paper, we choose to use the best-fitting r - model as the dispersion template, as this is the 
best representation of the stellar population. As the stellar mass is also based on this r-model, we use the same stellar population 
when comparing stellar to the dynamical mass. 

APERTURE CORRECTIONS FOR VELOCITY DISPERSION MEASUREMENTS 

Here, we investigate the effects of different apertures and extraction methods on the observed velocity dispersion. The standard 
approach is to use a power-law to scale the observed velocity dispersion, measured within a certain r aper , to the velocity dispersion 
as if measured within r e /%, using the following expression by Jorgensen et al. (1995): 
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Fig. 15. — Compa rison of the velocity dispersion when using different methods. Left panel: Velocity dispersion as derived with the models of BC03 and 
Conroy et al. 1 2009). Even though the models are based on different stellar libraries with different resolution, the velocity dispersion measured with these models 
are consistent. Middle panel Similar as the left panel but now using BC03 and Maraston & Stromback ( 201 1) for the comparison. In general we find consistent 
results within the errors, but UDS 19627 has a lower velocity dispersion when we use the MAI 1 models. Right panel: Velocity dispersion as derived with optimal 
template construction versus the best-fitting r-model. The optimal template is constructed from BC03 SSP models with a full range in age and metallicity. 
Dispersions from the optimal template construction are slightly higher, but well within the errors. 

Or using a more recent result from the SAURON survey (Cappellari et al. 2006): 
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In the case of Jorgensen et al. (1995), the velocity dispersions have been measured in the range of -1.5 < log(r aper /r e ) < -0.5, and 
for Cappellari et al. 2006 for -1.0 < log(r aper /r e ) < 0. In this work however, we are outside this range with \og(r aper / r e ) > 0.5, 
that is r e being much smaller than r aper . Also, the galaxies in our sample have effective radii much smaller than the FWHM of 
the PSF, which is why the standard approach most likely will not be valid. 

We analyze this problem in two steps. Firstly, we use an analytic description for the kinematic profile and match this to the 
observed relation as found by Cappellari et al. (2006). Secondly, using our model we study the behavior of the observed velocity 
dispersion for different apertures (both circular and non-circular) and different FWHM for the PSF. Our reference model will be 
a circular aperture with size r e , without the effects of seeing. 
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Fig. 16. — Observed kinematic profile in a circular aperture. Best fit is d = -0.089 



From Figure [18]it is clear, that by using a simple power law for the aperture correction, we would overestimate the corrections 
by a large fraction. Furthermore, we have shown that it is vital to use a suitable aperture and include the effects of seeing, 
especially when r &vel /r e > 2. 
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Fig. 17. — Observed kinematic profile in a circular aperture, while varying the FWHM of the psf (red) and the Cappellari et al. relation (dashed). The black line 
shows our reference model, i.e. circular aperture with no influence of the PSF. 



A good description of the kinematic profile for early galaxies is given by (see e.g. True et al. 1999, Bertin et al 2002) : 
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with -0.1 < d < 0. The observed kinematic profile within a circular aperture will be a projection of <x(r) 2 and the galaxies 
intensity profile : 
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Here, I ga i(r) is the Seisic Profile: 
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To avoid numerical issues in the center, we approximate eq. IB3l and eq. IB5l bv: 
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where r core is chosen to be 1 /30r e . We can estimate the power d by evaluating Eq. IB4l for different values of d and comparing the 
results to Eq. 1 from Cappellari et al. (2006). 

Figure [16] shows the Cappellari et al. relation (dashed) and our model with different values for d, normalized to r e . We find a 
best fitting value for d = -0.089 i n the region of (-1.0 < log{r aper / r e ) < 0. Notice that for log(r aper / r e ) > 0.0 our model deviates 
from the simple power law in Eq. IB3I 

Now that we have found the intrinsic kinematic profile, we can explore the influence of the PSF on the observed dispersion. 
The PSF is modelled using a combination of two gaussians, where <j\ = 2o"2, and both gaussians having equal flux (reference). 
This PSF is then convolved with the kinematic and intensity profiles: 
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The results for PSFs with different values of <x are shown in Figure [T71 

Instead of using a circular aperture, we now consider the case of a rectangular aperture, similar to what is used in the spectra 
from X-Shoot er. E q. IB8l can be modified to include a weight function g(y) which is commonly used in optimized extraction. The 
integral in Eq. |B9| is also replaced by a Riemann sum. In the x direction (slit width) the aperture size is always the same, i.e 0.9" 
for X-Shooter spectra, but the y direction is now parallel to r aper . 
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Figure [18] shows the difference between using a circular, rectangular, and rectangular aperture including a weighting function. 
The slit width that was used is 0.9", with the FWHM of the PSF also being 0.9". The spectrum was extracted with r aper = 0.45". 
We see that the correction is slightly higher for the rectangular aperture as compared to the circular aperture at r aper /r e = 2.25. 
The behavior at r aper /r e > 3 is very different for the three different cases. When using a rectangular aperture with optimized 
extraction, the observed profile is flatter as compared to the other models. The corrections are on average between 3 and 5%. 
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Fig. 18. — Observed kinematic profile in a different apertures: circular, rectangular, and rectangular including a weight function. Dashed line is the Cappellari 
et al. relation, solid line is our reference model (circular aperture, no PSF), and the red line is the modeled kinematic including the effect of the PSF. The vertical 
blue dashed line shows a typical r aper /r e for our observations. We note that r apcr is in the direction along the slit. 
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